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Abstract 

We consider the nonlinear integer programming problem of minimizing a 
quadratic function over the integer points in variable dimension satisfying a 
system of linear inequalities. We show that when the Graver basis of the ma- 
trix defining the system is given, and the quadratic function lies in a suitable 
dual Graver cone, the problem can be solved in polynomial time. We discuss 
the relation between this cone and the cone of positive semidefinite matrices, 
and show that none contains the other. So we can minimize in polynomial 
time some non-convex and some (including all separable) convex quadrics. 

We conclude by extending our results to efficient integer minimization of 
multivariate polynomial functions of arbitrary degree lying in suitable cones. 

1 Introduction 

Consider the general nonlinear integer minimization problem in standard form, 

min {/(x) : X eZ"^ , Ax = b, I <x <u} , (1) 

with A e Z*"^", 6 G Z™, /, M G ZJ^ with Zoo := Z W {±oo}, and / : M" ^ M. 

It is well known to be NP-hard already for linear functions. However, recently it 
was shown that, if the Graver basis Q{A) of A is given as part of the input, then the 
problem can be solved in polynomial time for the following classes of functions. First, 
in [1], for composite concave functions f{x) = g{Wx), with W G Z"^^", (7 : M"^ — )■ M 
concave, and d fixed. Second, in [3], for separable convex functions f{x) = J2i fii^i) 
with each /j univariate convex, and in particular for linear functions f{x) = x. 
While the Graver basis is a complex object, it can be computed in polynomial time 
from A for many natural and useful classes of matrices as demonstrated in [H |3]. 
Moreover, the results of [2j imply that there is a parameterized scheme that enables 
to construct increasingly better approximations of the Graver basis of any matrix A 
and obtain increasingly better approximations to problem ([T]), see [1] for details. 

In this article we continue this line of investigation and consider problem ([T]) for 
quadratic functions f{x) = x^Vx + x + a with V G M"^", w G M", and a G M. 
We also discuss extensions to multivariate polynomial functions of arbitrary degree. 

We begin by noting that problem ([1]) remains NP-hard even if the Graver basis 
is part of the input and even if the objective function is quadratic convex of rank 1. 
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Proposition 1.1 It is NP-hard to determine the optimal value of the problem 

mm \^x^Vx + w'' X + a : x G Z" , Ax = b , I < x < (2) 
even when Q{A) is given and the function is convex quadratic with matrix V = vv^ . 

Proof. Let v G and vq G Z+ be input to the subset sum problem of deciding if 
there exists x G {0, 1}" with x = Vq. Let A := he the zero 1 x n matrix, whose 
Graver basis Q{A) = {±lj : i = 1, . . . ,n} consists of the n unit vectors and their 
negations. Let / := and -u := 1 be the zero and all-ones vectors in Z", and let 
6 := in Z™. Let V := vv'' , w := —2vqV, and a := Vq. Then problem ([2]) becomes 

min I (t>^x — fo)^ : x G {0, , 
whose optimal value is if and only if there is a subset sum, proving the claim. □ 

This shows that to solve problem ([2]) in polynomial time, even when the Graver 
basis is given, some restrictions on the class of quadratic functions must be enforced. 

In Section 2 we introduce the quadratic Graver cone Q{A), which is a cone 
of n X n matrices defined via the Graver basis of A, and the diagonal Graver cone 
T^IA) which is the diagonal projection of Q{A) into M" . We discuss some elementary 
properties of these cones and their duals Q*{A) and V*{A) and give some examples. 

In Section 3 we prove the following algorithmic result about the solvability of 
problem ([1]) for every quadratic function (possibly indefinite, neither convex nor 
concave) whose defining matrix lies in the dual quadratic Graver cone. 

Theorem 1.2 There is an algorithm that, given G{A), solves the quadratic problem 

mm \^x^Vx + w'' X + a : x G Z" , Ax = b , I < x < (3) 

in polynomial time for every integer matrix V lying in the cone Q*{A) dual to Q{A). 

We point out that, in practice, the algorithm that underlies Theorem 11.21 can be 
applied to any quadratic function. The algorithm will always stop and output a 
feasible solution if one exists, which can be used as an approximation of the optimal 
one. And, whenever V lies in Q*{A), the solution produced will be true optimal. 
As a special case we obtain the following result on separable quadratic functions. 

Theorem 1.3 There is an algorithm that, given G{A), solves the separable problem 

n 

min{y~] [vix'^ + WiXi + aij : x G Z" , Ax = b , I < x < u} (4) 

i=l 

in polynomial time for every integer vector v lying in the cone V*{A) dual to T>{A). 
In particular, this applies to any convex separable quadratic, that is, with v G Z^. 
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In particular, Theorem II .31 enables us to solve the problem with any linear objective 
function f{x) = w''x, which is the special case with v = 0, which is always in V*{A). 

In Section 4 we proceed with a discussion of the relation between the dual 
quadratic Graver cone Q*{A) and the cone S!l of symmetric positive semidefinite 
matrices, and establish Theorem 14.21 which provides a characterization, in terms of 
their matroids only, of those matrices A for which the dual diagonal Graver cone 
V*{A) strictly contains MJ^ and for which Theorem 11.31 assures efficient solution of 
problem (j4]) for all separable convex as well as some nonconvex quadratic functions. 

In the final Section 5 we extend our results to multivariate polynomial functions 
of arbitrary degree. We define a hierarchy of higher degree analogues Vk{A) of the 
quadratic Graver cone, and show that the iterative algorithm of Theorem 11.21 solves 
the polynomial integer minimization problem ([1]) in polynomial time for every degree 
d form / that lies in a cone }Cd{A) defined in terms of the dual Graver cones Vl{A). 

Theorem 1.4 For every fixed d there is an algorithm that, given Q{A), solves 

min{/(x) : X G Z", Ax = 6, X > 0} (5) 
in polynomial time for every degree d integer homogenous polynomial f in ICd{A). 

2 The quadratic and diagonal Graver cones 

We begin with some notation. The inner product of two m x n matrices U, V is 
U-V := ^ • J UijVij. The diagonal of n x n matrix V is the vector v := diag(\^) G M" 
defined by Vi := Vi^i for all i. For u G we denote by U := Diag(u) the n x n 
diagonal matrix with diag(f/) = u. The pointwise product of vectors g,h & is the 
vector goh inW^ with {goh)i := gihi for all i. Note that g,h lie in the same orthant 
of M" if and only if g o h > 0. The tensor product of g,h & is the n x n matrix 
g ®h = gh^ with {g ® h)ij := {gh^)ij = gihj for all i,j. We will use the notation 
g ® h and gh' interchangeably as we find appropriate. Note that for all g,h E MP 
and V G M"^", we have g ok = diag(5' ® h) and {g ® h) ■ V = g^Vh. 

Any quadratic function /(x) = x^V"x + w''x + a has an equivalent description 
/(x) = x^Ux + w''x + a with U := ^{V + V'') symmetric matrix. We therefore 
can and will be working with symmetric matrices which are much better behaved 
than arbitrary square matrices. We denote by 5" C M"^" the linear subspace of 
symmetric n x n matrices. A cone is a subset V of real vector space such that 
ax + (3y E V for all x,y E V and a,(3 E M+. The cone generated by a set V of 
vectors is the set cone(V) of nonnegative linear combinations of finitely many vectors 
from V. In particular, cone(0) := {0}. We will be using cones P C of vectors 
and cones Q C 5" of n x n symmetric matrices. The dual of a cone D C and 
the (symmetric) dual of a cone Q C S"' are, respectively, the cones 

V* := G : uv>0,uEV}, Q* := {V E S"" : U -V >0, U E Q}. 

Duality reverses inclusions, that is, if P C /C are cones in or 5" then /C* CP*. 
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We proceed with the definition of the Graver basis of an integer matrix. The 
lattice of an integer m x n matrix A is the set C{A) := {x G : Ax = 0}. We 
denote by C*{A) the set of nonzero elements in C{A). We use a partial order □ on 

which extends the coordinate-wise partial order < on the nonnegative orthant 
M" and is defined as follows. For x,y G M" we write x ^ y and say that x is 
conformal to y if x o y > (that is, x,y lie in the same orthant) and \xi\ < \yi\ for 
all i. We write x \Z y if x ^ y and x ^ y. A simple extension of the classical Gordan 
Lemma implies that every subset of Z" has finitely many C-minimal elements. 

Definition 2.1 The Graver basis of an integer matrix A is defined to be the finite 
set g{A) C Z" of C-minimal elements in C*{A) = {x G Z" : Ax = 0, x^ 0}. 

In this article we introduce the following objects defined via the Graver basis. 

Definition 2.2 The quadratic Graver cone of an integer m x n matrix A is defined 
to be the cone Q{A) C 5" of n x n matrices generated by the matrices g®h + h®g 
over all pairs of distinct elements g,h E G{A) that lie in the same orthant, that is, 

Q{A) := cone {g ® h + h ® g : g,he g{A) , gy^h,goh>0} C 5". 

The dual quadratic Graver cone is its (symmetric) dual Q*{A) in 5" given by 

Q*{A) = {^G^" : U-V>0, UeQ{A)} (6) 
= {VeS^ : igh' + hg')-V > 0, g^heGiA), g ^ h , g o h > 0} 
= {1/g5" : g'Vh > 0, g,hEg{A), g^h, goh>0}. 

We are also interested is the following cone of diagonals of matrices in Q{A). 

Definition 2.3 The diagonal Graver cone of A is the cone of nonnegative vectors 

V{A) := cone {g o h : g,he g{A) , g^h,goh>0} C M^. 

The dual diagonal Graver cone is its dual V*{A) in M" given by 

V*{A) = {v : uv>0, ueV{A)} (7) 
= {v : (gohYv > 0, g,hegiA), g ^ h , g o h > 0} 
= {v : ^gihiVi > 0, g,heg{A), g^h, goh>0}. 

The following lemma provides some basic relations among the above cones and 
more. All inclusions can be strict, as is demonstrated in Examples 12. 51 and 12. 6l below. 
In particular, it is interesting to note that T>{A) is the diagonal projection of Q{A), 
but V*{A) is generally strictly contained in the diagonal projection of Q*{A). 

Lemma 2.4 The quadratic and diagonal Graver cones and their duals satisfy 

Rl D V{A) = {diag(f/) : U e Q{A)} D {u : Diag(n) G Q(A)} , (8) 
Rl C V*{A) = {v : Diag{v) e Q*{A)} C {diag(r) : V E Q*{A)} . 
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Proof. First, 'P(A) C M" because it is generated by nonnegative vectors. Therefore 
V*{A) 3 (M")* = M". To establish the top equahty note that the following are 
equivalent: u G T>{A); u = J^kf^kidk ° ^fe) for some suitable /i^ > 0, gk, G Q{A.); 
u = diag([/) with U = ^J^k f^^idk'^ hk + hk® gk)', and u = diag(f/) with U E Q{A). 
To establish the bottom equality note that the following are equivalent: v G V*{A)] 
(g o hYv > for all suitable g,h E GiA); V = Diag(f) with g^Vh > for all 
g, h; and V = Diag(f ) with V E Q*{A). The two remaining inclusions on the right- 
hand sides follow from diag(Diag(x)) = x. This completes the proof of the lemma. □ 

The next two examples show that all inclusions in Lemma [2.41 can be strict. 

Example 2.5 Consider the zero 1 x n matrix A := 0, whose Graver basis is given 
by G{A) = {±lj : i = 1, . . . ,n} . Then g o h = is the zero vector for all distinct 
g,h E Q{A) in the same orthant. So the diagonal Graver cone and its dual are 
V{A) = {0} C Wl and V*{A) = W DR^ so the left inclusions m ^ are strict. 

Example 2.6 Consider the 1x3 matrix A := {1 1 1) with Graver basis Q{A) = 
±{(1, —1, 0), (1, 0, —1), (0, 1, —1)}. The quadratic Graver cone and its dual satisfy 



Q{A) = cone 




a-d-e+f > 
Q*iA) = {[ d b f \ : b-d + e-f > 




c+d-e-f > 

2aa + ba + c\ 
^ { \ a + b 26 6 + c : a, 6, c G M > . (9) 
a + c b + c 2c J J 

The diagonal Graver cone and its dual are T>{A) = M" and V*{A) = W^. Therefore, 
the top and bottom inclusions on the right-hand side of equation ^ are strict, 

V{A)=Wl D {0} = {u : Diag(M) G 

V*{A)=Wl C = {diag(V) : VeQ*{A)}. 



3 Quadratic integer minimization 

We proceed to establish our algorithmic Theorems 11.21 and 11.31 We focus on the 
situation of finite feasible sets, which is natural in most applications. But we do allow 
the lower and upper bounds l,u E ZJ^ to have infinite components for flexibility of 
modeling (for instance, it is quite common in applications to have k = and Ui = oo 
for all i, with the resulting feasible set typically still finite). We also require our 
algorithms to identify and properly stop when the set is infinite. So in all algorithmic 
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statements, an algorithm is said to solve a (nonlinear) discrete optimization problem, 
if for every input, it either finds an optimal solution, or asserts that the problem is 
infeasible or the feasible set is infinite. We begin with a simple lemma that shows 
that we can quickly minimize a given quadratic function in a given direction. 

Lemma 3.1 There is an algorithm that, given bounds l,u & ZJ^, direction g G Z", 
point 2; G Z" with I < z < u, and quadratic function f{x) = x^Vx + x + a with 
V G Z""^", w G Z", and a G Z, solves in polynomial time the univariate problem 

min{/(z + fig) : jJ, E 2,+ , I < z + jig < u}. (10) 

Proof. Let S := {jj, E 2,+ : I < z + jig < m}, and let s := sup S which is easy to 
determine. If s = 00 then we conclude that S is infinite and stop. Otherwise we need 
to minimize the univariate quadratic function h{ji) := f{z + jig) = h2ji^ + /ii/i + /iq 
with /12 := g^Vg, hi := z^Vg + g^Vz + w''g, and Hq := z'^Vz + z + a over 
S" = {0, 1, . . . , s}. If /i2 < 0, then h is concave, and the minimum over S is attained 
at /i = or /i = s. If /12 > then h is convex with real minimum at /i* := — 
Then minimizing h over S reduces to minimizing h over S fl {0, \_ji*\ , [/i*] , s}. □ 

A finite sum u := ^jfj of vectors in is called conformal if f j □ m for all i, 
and hence all summands lie in the same orthant. The following lemma shows that 
quadratic / with defining matrix in the dual quadratic Graver cone is supermodular 
on conformal sums of nonnegative combinations of elements of the Graver basis. 

Lemma 3.2 Let A be any integer m x n matrix with quadratic Graver cone Q{A). 
Let / : — i- M 5e any quadratic function f{x) = x'^Vx + x + a with V G Q*{A). 
Let X G M" &e any point, and let ^ jiigi be any conformal sum in M" with gi G Q{A) 
distinct elements in the Graver basis of A and fii > nonnegative scalars. Then 

A := [f{x + J2^^^g^)-fi^)) - Y.^f{x + ii,gi)-f{x)) > 0. 
Proof. We have 

id 

and 

^{fix + ingi) - f{x)) = Y{x'Viiigi + ^aglVx + HiglV^ngi + w" lag^) . 
Therefore we obtain 

A = ^fXiglVfXjgj -^fXiglVfXigi = ^fiiglVfijgj = Y^ii^ijg]Vgj > 0, 
because gi, gj G G{A) satisfy gi o gj > and g^ 7^ gj for i ^ j, and V is in Q*{A). □ 



We need two more useful properties of Graver bases. First we need the following 
integer analogue of Caratheodory's theorem of [B] which we state without proof. 
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Lemma 3.3 Let A be an integer m x n matrix, and let Q{A) be its Graver basis. 
Then every x G £* (A) is a conformal sum x = Yll=i l^idi ^^^^ involves t < 2n — 2 
Graver basis elements Qi G Q{A) and nonnegative integer coefficients fii G Z+. 

The next lemma provides a Graver basis criterion for finiteness of integer programs. 

Lemma 3.4 Let Q{A) be the Graver basis of matrix A, and let l,u E ZJ^. // there 
is some g G G{A) satisfying gi < whenever ui < oo and gi > whenever U > —oo 
then every set of the form 5 := {x G Z" : Ax = b , I < x < u} is either empty or 
infinite, whereas if there is no such g, then every set S of this form is finite. Glearly, 
given the Graver basis, the existence of such g can be checked in polynomial time. 

Proof. Suppose there is such g and consider such S containing a point x. Then for 
all A G Z+ we have I < x + Xg < u and A{x + Xg) = Ax = b, and hence x + Xg E S so 
S is infinite. Next suppose S is infinite. Then P := {x G M."^ : Ax = b, I < x < u} is 
unbounded, and hence has a recession vector, which we may assume is integer, that 
is, a nonzero h such that x + ah G P for all x G P and a > 0. Then h G C*{A) and 
hi < whenever < oo and hi > whenever /j > — oo. By Lemma [3. 3 [ the vector 
h is a conformal sum h = ^(yfj of vectors gi G Q{A), each of which also satisfies 
gi < whenever Ui < oo and gi>Q whenever li > — oo, providing such g. □ 

Next we prove the main lemma underlying our algorithm, which shows that, 
given the Graver basis, and an initial feasible point, we can minimize a quadratic 
function with defining matrix in the dual quadratic Graver cone in polynomial time. 

Lemma 3.5 There is an algorithm that, given A G Z"^^", its Graver basis Q{A), 
bounds l,u E Z^, point 2; G Z" with I < z <u, and quadratic /(x) = x^Vx + w^x + a 
with integer V G Q*{A), w G Z", and a G Z, solves in polynomial time the program 

min{/(x) = x^Vx + w'' x + a : x G Z" , Ax = b , I < x < u} , b := Az . (11) 

Proof. First, apply the algorithm of Lemma [3.41 to Q{^Al) and /,u and either detect 
that the feasible set is infinite and stop, or conclude it is finite and continue. Next 
produce a sequence of feasible points Xo,Xi,...,Xs with xq := z the given input 
point, as follows. Having obtained x^, solve the minimization problem 

min{/(xfc + /ifif) : /i G Z+ , 51 G Q{A) , / < x^ + /i^f < m } (12) 

by applying the algorithm of Lemma [3. II for each g G Qi^-A). If the minimal value in 
(fT2|) satisfies /(x^ + [ig^ < f{xk) then set Xk+i := Xk + fig and repeat, else stop and 
output the last point Xg in the sequence. Now, Ax^+i = A{xk + Xg) = Axk = 6 by 
induction on k, so each x^ is feasible. Because the feasible set is finite and the x^ 
have decreasing objective values and hence distinct, the algorithm terminates. 

We now show that the point Xg output by the algorithm is optimal. Let x* be 
any optimal solution to flTT]) . Consider any point x^ in the sequence, and suppose 
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that it is not optimal. We claim that a new point Xk+i will be produced and will 
satisfy 

f{x,^,)-f{x*) < ^ (/(x,) - /(x*)) . (13) 

By Lemma 13.31 we can write the difference x* — Xk = Yll=i f^i9i conformal sum 
involving 1 < t < 2n — 2 elements Qi G G{A) with all /ij G Z4.. By Lemma [3.21 

\ i=l J 1=1 

Adding t {f{xk) — f{x*)) on both sides and rearranging terms, we obtain 

t 

Y^(f(xk + fi^g.)-f{x*)) < (t-l)(/(x,)-/(x*)). 

i=l 

Therefore there is some summand on the left-hand side satisfying 

f — 1 2r) — 3 

f{x, + f^.g,)-f{x*) < -—(fix,)- fix*)) < ^--^{f{x,)-f{x*)). 

So the point Xk + fJ^g attaining minimum in (fT2l) satisfies 

2r? — 3 

f(xk + fig)-f{x*) < f(xk + fiig^)-f{x*) < __ (/(xfc) - /(x*)) , 

and so indeed Xk+i ■= Xk + j^g will be produced and will satisfy ( |T3|) . This shows 
that the last point Xg produced and output by the algorithm is indeed optimal. 

We proceed to bound the number s of points. Consider any i < s and the 
intermediate non-optimal point Xi in the sequence produced by the algorithm. Then 
/(xj) > /(x*) with both values integer, and so repeated use of (fT3l) gives 

and therefore 

^ < (log^^TTT^) log(/(x)-/(x*)). 

Therefore the number s of points produced by the algorithm is at most one unit 
larger than this bound, and using a simple bound on the logarithm, we obtain 

s = 0(nlog(/(x)-/(x*))). 

Thus, the number of points produced and the total running time are polynomial. □ 



Next we show that, given the Graver basis, we can also find an initial feasible 
point for assert that the given set is empty or infinite, in polynomial time. 
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Lemma 3.6 There is an algorithm that, given integer m x n matrix A, its Graver 
basis Q{A), l,u & ZJ^, and b G Z™, in polynomial time, either finds a feasible point 
in the set S := {x ^ Z" : Ax = b, I < x < u} or asserts that S is empty or infinite. 

Proof. Assume that / < u and that Ij < oo and Uj > — oo for all j, because otherwise 
there is no feasible point. Also assume that there is no (7 G Q{A) satisfying qj < 
whenever uj < 00 and gj > whenever Ij > —00, because otherwise S is empty or 
infinite by Lemma 13.41 Now, either detect there is no integer solution to the system 
of equations Ax = b (without the lower and upper bound constraints) and stop, or 
determine some such solution a; G Z" and continue; it is well known that this can 
be done in polynomial time, say, using the Hermite normal form of A, see [5]. Let 

I ■= {j-lj<Xj<Uj} C {!,..., n} 

be the set of indices of entries of x that satisfy their lower and upper bounds. While 
/ C {1, . . . , ra} repeat the following procedure. Pick any index i ^ I. Then either 
Xi < li or Xi > Ui- We describe the procedure only in the former case, the latter 
being symmetric. Update the lower and upper bounds by setting 

Ij := min{/j, Xj} , Uj := ma.x{uj,Xj}, j = l,...,n. 

Solve in polynomial time the following linear integer program, for which x is feasible, 

max{2;j : 2; G Z" , Az = b , I < z < u , Zi < Ui}, (14) 

by applying the algorithm of Lemma [3.51 using the function f{z) := z^Oz + ijz + 
with V = the zero matrix which is always in Q*{A). Now Ij > —00 if and only 
if Ij > —00, and Uj < 00 if and only if Uj < 00. So there is no (? G G{A) satisfying 
gj < whenever Uj < 00 and gj > whenever Ij > —00, and hence the feasible set 
of ([H]) is finite by Lemma 13.41 and has an optimal solution z. If Zi < k then assert 
that the set S is empty and stop. Otherwise, set x := z, I := {j : Ij < Xj < Uj}, 
and repeat. Note that in each iteration, the cardinality of / increases by at least 
one. Therefore, after at most n iterations, either the algorithm detects infeasibility, 
or / = {1, . . . ,n} is obtained, in which case the current point x is feasible. □ 

We are now in position to establish our theorem. 

Theorem 11.21 There is an algorithm that, given A G Z™^", its Graver basis G{A), 
bounds l,u & ZJ^, b G Z™, integer matrix V G Q*{A) in the dual quadratic Graver 
cone, 1(7 G Z", and a G Z, solves in polynomial time the quadratic integer program 

mm{x^Vx + w'' X + a : x G Z" , Ax = b , I < x < u}. 

Proof. Use the algorithm underlying Lemma 13.61 to either detect that the problem 
is infeasible or that the feasible set is infinite and stop, or obtain a feasible point 
and use the algorithm underlying Lemma 13.51 to obtain an optimal solution. □ 
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An important immediate consequence of Theorem 11.21 is that we can efficiently 
minimize separable quadratic functions defined by vectors in the dual diagonal 
Graver cone. In particular, it applies to every convex separable quadratic function 
(which can also be deduced from the results of (3| on separable convex functions). 

Theorem 11.31 There is an algorithm that, given A e Z™^", its Graver basis Q{A), 
bounds l,u & ZJ^, b G Z™, integer vector v G I)*{A) in the dual diagonal Graver 
cone, and w,a & Z", solves in polynomial time the separable quadratic program 

n 

min{y~^ (vixf + WiXi + Oj) : a; G Z" , Ax = b , I < x < u}. 

8=1 

In particular, this applies to any convex separable quadratic, that is, with v G Z" . 

Proof. First, for any v G 'D*(A) we have V := Diag(f) G Q*{A) by Lemma EH 
Hence, by Theorem II. 2 [ we can minimize in polynomial time the quadratic function 

n n 

[Vix'^ + WiXi + Oj) = X^Vx + W^X + Oj . 
i=l 1=1 

Second, if the separable quadratic function is convex, which is equivalent to its 
defining vector v being nonnegative, then v G M" C V*{A) by Lemma [2.41 again. 
Hence the second statement of the theorem now follows from the ffist statement. □ 



4 Nonconvex solvable quadratics and matroids 

Consider the quadratic minimization problem, with the Graver basis of A given, 

min {/(x) = x^Vx + w'' x + a : x G Z*^ , Ax = b , I < x < . (15) 

The function / is convex if and only if its defining matrix V is positive semidefinite, 
that is, if X Vx > for all x G M'^. Let C 5" denote the cone of symmetric 
positive semidefinite matrices. Now, on the one hand, if V E Q*{A) then, by 
Theorem 11.21 we can solve problem f|T5l) efficiently. On the other hand, if \^ G 5" 
then / is convex, and problem f[T^ may seem to be easier, but remains NP-hard even 
for rank-1 matrices V = vv^ G 5" by Proposition 11.11 So it is unlikely that Q*{A) 
contains S^, and it is interesting to consider the relation between these matrix cones. 

For this, we need a couple of basic facts about positive semidefinite matrices. 
First, Note that for any vector u G M", the rank-1 matrix uu is in tS" because 
x^{uu)x = (u^x)^ > for all x G M", whereas for any two linearly independent 
vectors g,h E M", the rank-2 matrix gh' + hg^ is in S'^\S'^ because there is an x G M" 
with g^ X = 1 and x = —1 and hence x^{gh^ + hg^)x = 2{g^ x){h' x) = —2 < 0. 
Second, the cone of symmetric positive semidefinite matrices is self dual, that is, 
{SD* = SI- To see this, note that if ?7 G 5" \ 5!^ then there is an x G M" with 
{x®x)-U = x^Ux <OsoU ^ (5")*; and if G 5" has rank r, then V = X^Li 
for some Xi G M" and hence U -V = YJ'i=i ^lUxi > for all U E S^, so V E (5^)*. 
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So we can conclude the following. In the rare situation where each orthant of 
M" contains at most one element of Q{A), we have Q{A) = {0} and Q*{A) = 5", so 
Theorem 11.21 enables to solve problem f|T5|) for any quadratic function. In the more 
typical situation, where some orthant does contain two elements g,h & d{A), the 
corresponding generator of Q{A) satisfies gh^ + hg^ G iS"\iS" and hence Q{A) ^ iS". 
By self duality of 5", we obtain 5" = (5")* ^ Q*{A). So we cannot solve problem 
( ITSj) for all convex quadratics, reflecting the NP-hardness of the convex problem. 
But we do typically also have Q*{A) ^ 5", that is, we can solve problem ( !T5|) in 
polynomial time for various nonconvex quadratics. For instance, in Example 12. 6[ the 
matrix in is not positive semidefinite for all a,b,c < 0. Moreover, by Lemma [231 
mi C V*{A) = {v : Diag(t;) G Q*{A)}, so Q*{A)\Sl whenever D*( A) ^ 

We proceed to discuss this diagonal case, where the function / is defined by a 
diagonal matrix V = Diag(f) for some v G M", that is, / is separable of the form 
/(x) = ^iivix'f + WiXi + tti). In this case, / is convex if and only if v is nonnegative. 
As noted in Lemma (2.41 the dual diagonal Graver cone 'D*{A) always contains the 
nonnegative orthant M". We proceed to characterize those matrices A for which 
this inclusion is strict, so that 'D*{A) \ M" ^ and Theorem 11.31 enables to solve 
problem (fT5|) in polynomial time also for various nonconvex separable quadratics. 

For this we need a few more definitions. A circuit of an integer matrix A is an 
element c G C*{A) whose support supp(c) is minimal under inclusion and whose 
entries are relatively prime. We denote the set of circuits of A by C{A). It is easy 
to see that for every integer matrix A, the set of circuits is contained in the Graver 
basis, that is, C{A) C Q{A). Recall that a finite sum u := Xlj^i of vectors in is 
conformal if f j □ u for all i, and hence all summands lie in the same orthant. The 
following property of circuits is well known. For a proof see, for instance, |1] or [7]. 

Lemma 4.1 Let A he an integer matrix. Then every x G C*{A) is a conformal sum 
X = ^iUiCi involving circuits Ci G C{A) and nonnegative real coefficients ai G IR+. 

It turns out that the matroid of linear dependencies on the columns of the integer 
mxn matrix A (over the reals or integers) plays a central role in the characterization 
we are heading for. A matroid- circuit is any set C C {1, . . . , n} that is the support 
C = supp(c) of some circuit c G C{A) of A. Note that a circuit c is in C{A) if 
and only if its antipodal — c is, and if c, e G C{A) are circuits with c ^ ±e then 
supp(c) 7^ supp(e). We denote the set of matroid-circuits of A, that is, the set of 
supports of circuits in C{A), by J^{A) := {supp(c) : c G C{A)}, and refer to it 
simply as the matroid of A. For instance, for the 1x3 matrix A := [1 2 1) we have 

C{A) = ±{(2, -1,0), (0,-1,2), (1,0,-1)}, M{A) = {{1, 2}, {2, 3}), {1, 3}} . 

We now characterize those matrices A for which V*{A) strictly contains M.^. 

Theorem 4.2 The dual diagonal Graver cone of every integer mxn matrix A 
satisfies V*{A) ^ M", and the inclusion is strict if and only if there is 1 < k < n 
such that C n E ^ {k} for every two distinct matroid-circuits C,E & Ai{A) of A. 

Proof. We prove the dual statements about the diagonal Graver cone. By definition 
T>{A) C M", and the inclusion is strict if and only if some unit vector 1^ is not in 
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'D{A). Therefore it suffices to prove that, for any 1 < A; < ra, we have 1^ G 'P(A) if 
and only if there are two distinct matroid-circuits C,E & Ai{A) with C (1 E = {k}. 

Suppose ffist C,E E A4.{A) are distinct matroid-circuits with C r\ E = {k}. 
Then there are c, e G C{A) with c 7^ ±e such that supp(c) = C and supp(e) = E. 
Replacing e by — e G C{A) if necessary we may assume that CfcCfc > 0. Then coe > 0, 
c 7^ e, and c, e G Q{A) imply that c o e = CfcCfcl/c is a generator of T>{A), and hence 
Ifc G ^^(^)- Conversely, suppose Ifc G ^^(^)- Because C M", some nonneg- 

ative multiple of Ik must be one of the generators. So there are g,h & G{A) with 
g o h > and g ^ h such that g o h is a nonnegative multiple of 1^, and hence 
supp((7) n supp(/i) = {k}. By Lemma WAl we have (7 = Ylii^i^i ^ = Ylij^j^j 
conformal sums of circuits with nonnegative coefficients. Then supp(5') = U supp(ci) 
and supp(/;.) = Usupp(ej), and hence there are q and Cj among these circuits such 
that supp(cj) n supp(ej) = {k}. Let C := supp(ci) and E := supp(ej) be the cor- 
responding matroid-circuits of A. It remains to show that C and E are distinct. 
Suppose indirectly that C = E. Then C = E = Cr\E = {k}. This implies that the 
k-th column of A is and q = ej = ±1^. But then Ci ^ g and ej C h, and therefore 
g = Ci = Cj = h which is a contradiction. So C E, and the proof is complete. □ 

It is interesting to emphasize that the characterization in Theorem 14.21 is in terms 
of only the matroid of A, that is, the linear dependency structure on the columns of 
A. The algorithm of Theorem 11.31 enables to solve in polynomial time the program 

n 

min{^^ [vix'^ + WiXi + Oj) : a; G Z" , Ax = b , I < x < u} 

i=l 

for all separable quadratics with v G V*{A) and in particular for all separable convex 
quadratic functions with v G M" . So the algorithm can solve the program moreover 
for some separable nonconvex quadratic functions precisely when the matroid of A 
satisfies the criterion of Theorem 14.21 Here are some concrete simple examples. 

Example 4.3 Consider again Example \2.5\ with A := the zero Ixn matrix having 
Graver basis G{A) = {±lj : i = 1,. . . ,n}. Then the set of matroid-circuits of A is 
M{A) = {{l},...,{n}}. Therefore C HE = ^ for all distinct C,E e M{A) and the 
condition of Theorem \4^ trivially holds, so V*{A) 3 In fact, here V*{A) = M". 

Example 4.4 Directed graphs. Let G be a directed graph, and let A be its V x E 

incidence matrix, with A^^^ := 1 if vertex v is the head of directed edge e, A^^^ '■= ~1 
if V is the tail of e, and A^^e '■= otherwise. The set Ai{A) of matroid-circuits 
consists precisely of all subsets G ^ E that are circuits of the undirected graph 
underlying G. The setC{A) of circuits consists of all vectors c G {—1, 0, l}'^ obtained 
from some matroid circuit G E by choosing any of its two orientations and setting 
Cf. := 1 if directed edge e G C agrees with the orientation, Cg := —1 if e disagrees, 
and Ce := if e ^ G. The Graver basis is equal to the set of circuits, Q{A) = C{A). 
By Theorem \4.2\ we have T>*[A) D M;^ if and only if there is an edge e & E such that 
no two distinct circuits G, G' of the underlying undirected graph satisfy GCiG' = {e}. 
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Example 4.5 Generic Matrices. Let A be a generic integer m x n matrix, that 
is, a matrix for which every set ofm columns is linearly independent, say, the matrix 
defined by Aij := j* for all i,j, whose columns are distinct points on the moment 
curve in M™. Then the matroid of A is uniform, that is, its matroid- circuits are 
exactly all (m + l)-subsets of {l,...,n}. Suppose n < 2m. Then every distinct 
C,E e M{A) satisfy \C n E\ > 2, and hence V*{A) D by Theorem^ So, by 
Theorem 

n 

[vix"^ + WiXi + flj) : X G , Ax = b , I < x < u} 



min-^ 



can be solved in polynomial time for all such A, all b G and l,u & ZJ^, all convex 
and some nonconvex separable quadratic functions defined by data v,w,a & Z". 



5 Higher degree polynomial functions 

The algorithm that underlies our algorithmic Theorem 11.21 using the Graver basis is 
conceptually quite simple. First, it finds in polynomial time a feasible point. Then 
it keeps improving points iteratively, as long as possible, where, at each iteration, 
it takes the best possible improving step attainable along any Graver basis element. 
It outputs the last point from which no further Graver improvement is possible. 

We now proceed to show that the results of the previous sections can be extended 
to multivariate polynomials of higher, arbitrary, degree. We will define a hierarchy 
of cones, and whenever a polynomial function will lie in the corresponding cone, the 
algorithm outlined above will converge to the optimal solution in polynomial time. 

It will be convenient now to make more extensive use of tensor notation, and to 
work with the tensored, nonsymmetrized form of a polynomial function. We use 

<S)dW :=]R"(g)---®]R" , := X (g) ■ ■ ■ x eW 

for the (i-fold tensor product of with itself and for the rank-1 tensor that is the 
(i-fold product of a vector x with itself, respectively. Note that the {ii, . . . ,id)-th 
entry of ®da; is the product ■ ■ ■ of the corresponding entries of x. We denote 
the standard inner product on the tensor space by 

n n 

{U,V) := ^ • • • J] ?7.,...,,V^,„...,, , U,V e ®,MV 
*i=i *d=i 

In particular, in the vector space M" we have {x,y) = x^y and in the matrix space 
(g) M" we have {U, V) = U ■ V. Note that for any two rank-1 tensors we have 

d 
k=l 

For simplicity, we restrict attention to homogeneous polynomials, also termed 
forms. A form /(x) of degree d in the vector of n variables x = (xi, . . . , x„) can be 
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compactly defined by a single tensor F G ^dlR" that collects all coefficients, by 

n n 

fix) := {F,0dx) = J^--- J^Fi,„..,i^a;i, ■■■Xi, . 

ii=l «iJ=l 

For instance, the form f{x) = (xi + X2+ Xa)^ of degree ci = 3 in n = 3 variables can 
be written as f{x) = {F,^^x) = {^^Ij^^x) = with 1 the all-ones vector in 

and F = the all-ones tensor in ^sM'^, with Fi^^i^^i.^ = 1 for ii,i2,i3 = 1,2,3. 

Let A be any integer m x n matrix, and let Q{A) be its Graver basis. For each 
degree d > 2 we now define a cone Vd{A) in the tensor space as follows. 

Definition 5.1 The Graver cone of degree d of an integer m x n matrix A is the 
cone Vd{A) C (gj^^IR" generated by the rank-1 tensors g"^ ® ■ ■ ■ ® g'^ where the g'^ are 
elements of Q{A) that lie in the same orthant and are not all the same, that is 

Vd{A) := cone{5f^ ■ ■ ■ (g) g'^ : g' e Q{A), g^ o g^ > for all i,j, g' ^ g^ for some 

The dual Graver cone of degree d is its dual V^{A) in (S)^!^" given by 

V:{A) = {V e ®d^^ : {U,V)>0, UeVdiA)} = {V : {g' ® ■ ■ ■ ® g^^V) > 0, 

g^ G Q{A), g' o g^ > for all g' 7^ g^ for some i,]}. 

Note that V2{A) is the nonsymmetrized version of that is, Q{A) = V2{A)nS"' . 

One of the key ingredient in extending our algorithmic results to polynomials 
of arbitrary degree is the following analogue of Lemma 13.21 which establishes the 
supermodularity of polynomial functions that lie in suitable cones. We need one 
more piece of terminology. Let D := {1, . . . , and for < k < d let (^) be the set 
of all /c-subsets of D. A fc-dimensional subtensor of a d-dimensional tensor 

F = : l<^l,...,^d<n) E ®dM" 

is any of the (f)^'^^'^ tensors T G ^fclR" obtained from F by choosing I G [^), 
letting each index ij with j G / vary from 1 to n, and fixing each index ij with j ^ I 
at some value between 1 and n. For instance, the /c-dimensional tensor obtained by 
choosing I = {1, . . . ,k} and fixing some values 1 < ik+i, id < n is 

T = (Ti,,...,i, := : I < ii, . . . ,ik < n) G ®fcM" . 

For an integer m x n matrix A, let lCd{A) C (^f^M" be the cone of those tensors 
F such that, for all 2 < /c < d, every /c- dimensional subtensor of F is in ^^^{A). 

Lemma 5.2 Let A he integer mxn matrix. Let f : — )■ M 5e degree d form given 
by f{x) = (F, with F G )Cd{A). Let x G M" be nonnegative and Yll=il-''rg^ 

conformal sum in M" with g^ G Q{A) distinct and /ir > nonnegative scalars. Then 

A := (^f(^X + J2flrg'^-f{x)^ - J2(f(x + flrgn-f{x)) > 0. 
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Proof. To simplify the derivation we assume that all /x^ = 1- The same argument 
goes through in exactly the same way for arbitrary nonnegative Hr- For r = 1, . . . ,t, 

= {F, g"" (g) X ■ ■ ■ (g) x) + ■■■ + (F, X O ■ ■ ■ (g) ,T (g) 

i ( g"-, iel 



X, i i I 



k=2 ^ ^ ^ 

Similarly, 

fix + i29l - fix) = (f, ^d(^x + J2 9'^ |) - ®^^) 

t t 
= (F, ^ (g) X (g) ■ ■ ■ (g) x) + • • • + (F, X (g) ■ ■ ■ (g) X (g) ^ g' ) 

r=l r=l 



fc=2 

Therefore, 



k=2 I r=l ^ ^ 



X, i ^ I ' \ X, i ^ I 

Now, consider any 2 < k < d and any / G (^) . For simplicity of the indexation, we 
assume that I = {1, . . . ,k}. The derivation for other I is completely analogous. For 
each choice of indices 1 < ik+i, ■■■ ,id < n let T(ifc+i, . . . ,id) be the A;- dimensional 
subtensor of F obtained by letting ii, . . . ,ik vary and fixing ik+i, ■ ■ ■ ,id as chosen. 
Then the corresponding summand of A in the expression ( 1T6|1 above satisfies 

F,®k(j29j®i®d-kx)\ - J2^F,i®kgl®i(^d-kx)) (17) 

\ \r=l J I r=l 

n n j / * \ * 

= ■ ■ ■ 5Z ^^fe+i ■ ■ ■ ( Tiik+u • • • , ^rf), ®fc ( ) ~ XI 

jfc+i=l id=l \ \r=l / r=l 

The summand in f|T7|) above which corresponds to 1 < ■ ■ ■ lid^n satisfies 

r(zfe+i, . . . , ^rf), ® J ^ / ) - ^ ®kg'\ = (18) 

\r=l / r=l / 

^ {(T(?fc+i, . . . ,Zd),5''"' (g) ■ ■ ■ (g) : 1 < ri, . . . ,rfc < t, not all the same} . 
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Now, because all the g'"" are in the same orthant, and all fc- dimensional subtensors 
of F lie in the dual Graver cone V^A), each summand on the right-hand side of 
f|T8|) above satisfies {T{ik+i, . . . ,id),g^^ (S) • • ■ g^'') > 0, and so the left-hand side 
of ( |T8l) is nonnegative as well. Because x G M" is nonnegative, each summand on 
the right-hand side of ( |T7|) above is nonnegative, and so the left-hand side of ( fT7|) is 
nonnegative as well. Because this holds for all 2 < k < d and all / G (^) , we obtain 
that each summand on the right-hand side of (fT6l) is nonnegative, and so A > as 
claimed. □ 

A second key ingredient is the following analogue of Lemma EH] which shows that 
we can efficiently minimize a given form of any fixed degree in a given direction. 

Lemma 5.3 For every fixed d, there is an algorithm that, given l,u & ZJ^, z, G Z*^ 
with I < z <u, and f{x) = {F, ®dx) with F G ®dZ", solves in polynomial time 

min{/(z + fig) : fi E Ij-^- , I < z + fig < u}. (19) 

Proof. Let S := {fi E : I < z + fig < u}, and let s := sup S which is easy to 
determine. If s = cxd then we conclude that 5* is infinite and stop. Otherwise we need 
to minimize the univariate degree d polynomial h{fi) := {F, ®d{.z+ fig)) = Yl'i=o 
whose coefficients hi can be easily computed from F, over 5 = {0, 1, ... , s}. 

Outline: use repeated bisections and Sturm's theorem which allows us to count 
the number of real roots of h in any interval using the Euclidean algorithm on 
^(/^) = Sf=o ^«/^' derivative h'{fi) = X]to(^ + ^)hi+ifi\ to find intervals 

[rj,Sj], i = l,...,d (possibly with repetitions if h has multiple roots) containing 
each real root of h, and such that Sj — < 1 for all i. Then minimizing h over S 
reduces to minimizing h over 5* fl {0, [ri] , [sij , . . . [r^] , [s^J , s}. □ 

We can now establish our theorem on polynomial integer minimization. 

Theorem II. 41 For every fixed d there is an algorithm that, given integer mxn matrix 
A, its Graver basis Q{A), b G Z™, and degree d integer homogenous polynomial 
f{x) = {F, <^dx) with F G lCd{A), solves in polynomial time the polynomial program 

min{/(x) = {F, : x G Z"" , Ax = b , x > 0}. 

Proof. First, use the algorithm of Lemma [3.61 to either detect that the problem is 
infeasible or that the feasible set is infinite and stop, or obtain a feasible point and 
continue. Now, apply the algorithm of Lemma [3.51 precisely as it is, using the given 
form f{x) instead of a quadratic. Lemmas 15.21 and 15.31 now assure that the analy- 
sis of this algorithm in the proof of Lemma 13.51 carries through precisely as before, 
and guarantee that the algorithm will find an optimal solution in polynomial time. □ 
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